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Abstract 

In the fc-edge-connected spanning subgraph problem we are given a graph (V, E) and costs for each 
edge, and want to find a minimum-cost F C E such that (V, F) is fc-edge-connected. We show there is a 
constant e > so that for all k > 1, finding a (I + e)-approximation for fc-ECSS is NP-hard, establishing 
a gap between the unit-cost and general-cost versions. Next, we consider the multi-subgraph cousin of 
fc-ECSS, in which we purchase a multi-subset F of E, with unlimited parallel copies available at the 
same cost as the original edge. We conjecture that a (1 + 0(l/fc))-approxfmation algorithm exists, and 
we describe an approach based on graph decompositions applied to its natural linear programming (LP) 
relaxation. The LP is essentially equivalent to the Held-Karp LP for TSP and the undirected LP for 
Steiner tree. We give a family of extreme points for the LP which are more complex than those previously 
known. 

1 Introduction 

In the fc-edge-connected spanning subgraph problem (fc-ECSS), we are given an input graph G with edge 
costs, and must select a minimum-cost subset of edges so that the resulting graph has edge-connectivity fc 
between all vertices. This is a natural problem for applications, since it is the same as seeking resilience 
against (fc — 1) edge failures, or the ability to route k units of flow between any pair of vertices. A natural 
variant of fc-ECSS is to allow each edge to be purchased repeatedly, as many times as desired, with each 
copy at the same cost. We call this the k-edge-connected spanning multi- subgraph problem (fc-ECSM). 

When k — 1 the fc-ECSS and fc-ECSM problems are both equivalent to the minimum spanning tree 
problem, which is well-known to be solvable in polynomial time, but they are non-trivial for fc > 1. We 
consider approximation algorithms for these problems: an algorithm that approximately solves fc-ECSS or 
fc-ECSM is said to be an a -approximation, or have approximation ratio a, if it always outputs a solution 
with cost at most a times optimal. 

Here we survey the oldest and newest results for fc-ECSM and fc-ECSS. Frederickson & Jaja gave a 2- 
approximation algorithm for 2-ECSS [23], and a 3/2-approximation in the special case of metric costs [24]. 
A 3/2-approximation is possible for 2-ECSM [9]. For fc-ECSS/fc-ECSM in general, Khuller & Vishkin [32] 
gave a matroid-based 2-approximation, and Jain's iterated LP rounding framework [31] also gives a 2- 
approximation. Goemans & Bertsimas [29] give an approximation algorithm for fc-ECSM with ratio | when 
fc is even, and (| + ^) when fc is odd. Fernandes [21] showed 2-ECSS is APX-hard, even for unit costs. 

An important special case is where all edges have unit cost. Then fc-ECSS gets easier to approximate 
as fc gets larger: Gabow et al. [26] gave an elegant (1 + 2/fc)-approximation algorithm for fc-ECSS/fc-ECSM 
using iterated LP rounding, and they showed that for some fixed e > 0, for all fc > 1, it is NP-hard 
to get a (1 + e/fc)-approximation algorithm for unit-cost fc-ECSS. Together, these establish a 1 + 0(l/fc) 
approximability threshold for unit-cost fc-ECSS. Improvements to the constant, and improvements in the 
special case that the input graph is simple, appear in Cheriyan & Thurimella [15] and Gabow & Gallagher [25]. 
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1.1 Contributions 



1.1.1 Hardness Results (Section 2) 

Our first main result is the following hardness for fc-ECSS: 

Theorem 1. There is a constant e > so that for all k > 2, it is NP-hard to approximate k-ECSS within 
ratio 1 + e, even if the costs are 0-1. 

Although e « here is small, the qualitative difference is important: whereas the approximability of 
unit-cost fc-ECSS tends to 1 as A: tends to infinity, we see that the approximability of general-cost fc-ECSS 
is bounded away from 1. 

Next we establish a relatively straightforward hardness result for fc-ECSM. 

Proposition 2. The 2-ECSM problem is APX-hard. 

The key step is to show that 2-ECSM and metric 2-ECSS are basically the same problem. First, we 
use the following well-known fact: in fc-ECSM, the input is metric without loss of generality [29] (i.e. the 
graph is complete and its costs satisfy the triangle inequality). 1 Then, simple reduction techniques show 
that under metric costs, any 2-ECSM can be efficiently converted to a 2-ECSS without increasing the costs. 
We remark that this approach also yields a simpler 3/2-approximation for 2-ECSM (c.f. [9]), using the 
3/2-approximation for metric 2-ECSS [24] as a black box. 

What Proposition 2 leaves to be desired is hardness for fc-ECSM, fc > 2, and asymptotic dependence on 
fc. Why is it hard to show these problems are hard? The hard instances for fc-ECSS given by Theorem 1 and 
[26] contain certain mandatory parts that are "without loss of generality" included in the optimal feasible 
solution; the argument proceeds to show hardness of the residual problem once the mandatory parts are 
included. But coming up with suitable mandatory parts for fc-ECSM, while keeping the residual problem 
hard, is tricky: e.g. the proof of Theorem 1 will use a spanning tree of zero-cost edges, but in fc-ECSM 
this leads to a trivial instance (buy that spanning tree fc times). The known hardness for fc-VCSS (vertex 
connectivity) by Kortsarz et al. [34] is similar: we take hard instances of 2-VCSS and add (fc — 2) new 
vertices, connected to all other vertices by 0-cost (mandatory) edges. A new trick seems to be needed to get 
a good hardness result for fc-ECSM. 

1.1.2 fc-ECSM Conjecture (Section 3) 

We conjecture that approximation ratio 1 + 0(1/ k) should be possible for fc-ECSM, using LPs. Obtain the 
natural LP relaxation of fc-ECSM by allowing edges to be purchased fractionally: introduce a variable x e for 
each edge, and require that there is a fractional value of at least fc spanning each cut (see Figure 1, where 
S(S) denotes the set of edges with exactly one end in S). 

Conjecture 3. There is a polynomial-time approximation algorithm for k-ECSM which produces a solution 
of value at most (1 + C/fc) ■ OPT(A4) for some universal constant C. 

This conjecture implies a (1 + C/fc)-approximation algorithm, since OPT(A4) is a lower bound on the 
optimal fc-ECSM cost. What makes us think Conjecture 3 is true? First, we know it holds for unit costs. 
Second, the same holds in related high- width problems; to explain, say an integer program has width W if 
in every constraint, the right-hand side is at least W times every coefficient. Multicommodity flow/covering 
problems in trees are closely related to (A4) via uncrossing (e.g. [31, 26, 25]) and they admit an LP- 
based 1 + 0(l/M /r )-approximation algorithm [33] (in that setting W is the minimum edge capacity). Similar 
phenomena are known for LP relaxations of other structured integer programs [14, 13, 39, 5]. In fc-ECSM the 
width is fc so one may view our conjecture as seeking integrality gap 2 and approximation ratio 1 + 0(\/W). 

1 To see this, take the metric closure (i.e. shortest path costs), solve it, and replace each Mu-edge in the solution with a 
shortest u-v path from the original graph; it is not hard to show this preserves fc-edge-connectivity. In fc-ECSS, note metricity 
is not WOLOG, since the replacement step here can introduce multiple edges. 

2 The integrality gap is the worst-case ratio of the integral optimum to the LP optimum. 
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Figure 1: The undirected relaxation for fc-edge connected spanning multi-subgraph. The unbounded version 
(A4) is on the left, the bounded version (A/jQ is on the right. They have the same value for metric costs, 
including all fc-ECSM instances. 



Later, we show an open problem of [4] — can every fc-edge connected graph be partitioned into two 
spanning (| — 0(l))-edge-connected subgraphs? — would imply a nonconstructive version of Conjecture 3. 
Few partial results towards Conjecture 3 are known: the integrality gap of (A/i) is 2(1 — 1/n) [29], and that 
of (A/2) is at most 3/2 [42]. For general k, the best integrality gap bounds known for (A/fc) come from the 
approximation algorithms [31, 29, 25, 26] mentioned earlier. 

One further motivation to investigate the conjecture has to do with the parsimonious property of Goemans 
& Bertsimas [29]. Using metricity and splitting- off, they showed the constraint Vi> G V : x(6(v)) = k can 
be added to (A/fc) without affecting the value of the LP (the strengthened LP (Afj.) is shown in Figure 1). 
As observed in [29], parsimony implies that Conjecture 3 would give a (1 + ^-approximation algorithm for 
subset k-ECSM, where we require edge-connectivity k only amongst a pre-specified set of terminal nodes 
(generalizing the Steiner tree problem). Thus even if we don't care about LPs a priori, they have algorithmic 
dividends in Conjecture 3. 

1.1.3 Complex Extreme Points (Section 4) 

In both of the LPs (A4) and (A/^), note that k serves only as a scaling factor: x is feasible for (A/i) iff kx is 
feasible for (A/fe). In fact, these LPs are well-studied: (A/i) is equivalent (by the parsimonious property [29]) 
to the undirected cut relaxation of the Steiner tree problem and (A/^) is the Held-Karp relaxation of the 
Traveling Salesman Problem. We demonstrate a family of extreme point solutions to these ubiquitous LPs 
which are more complex than were previously known. 

For a solution x, the support is the edge set {e | x e > 0}, and the support graph is the graph with vertex 
set V and the support for its edge set. The fractionality of x is min{x e | e G E,x e > 0}. 

Theorem 4. There are extreme point solutions for the linear program (A/^) with fractionality exponentially 
small in \V\, and whose support graph has maximum degree linear in \V\. 

The members of the family are also extreme point solutions for (A/2), since (A2) is a face of (A/2). 
The motivation for this theorem comes from a common design methodology in LP-based approximation 
algorithms [31, 26, 28, 40]: algorithmically exploit good properties of extreme point solutions. E.g., Jain's 
algorithm [31] uses the fact that when (A/fe) is generalized to skew-submodular connectivity requirements, 
every extreme solution x* has an edge e with x* > |. Hence, complex extreme points give some idea of what 
properties might or might not exist that can be exploited algorithmically. 

Theorem 4 significantly improves previous results in the same vein. (A long-standing conjecture that the 
Held-Karp relaxation (A/^) has integrality gap at most 4/3 has motivated some of the work, e.g. [12, 6].) 
Boyd and Pulleyblank [11] showed that for any even |V| > 10, there is an extreme point of (A/^) with values 
in {1/t, 2/t, 1 — 2/t, 1 — 1/t, 1}, hence fractionality 2/ (|V| — 4). Cheung [16] gave a family of extreme points 
on Q(t 2 ) vertices with maximum degree At + 2 and entries in {l/(2t+ 1), 1 — l/(2t + 1), 1}, for every integer 



3 



t > 1, hence maximum degree 6(VT^T) m the support graph. The construction in Theorem 4 was found 
with the assistance of computational methods, as we describe later. 

We remark that a manuscript of Cunningham & Zhang [19] observes that by gluing together copies of 
the Boyd-Pulleyblank construction as blocks (2- vertex-connected components), one can get extreme points 
of (A/2) with denominator min{t £ Z>o | tx integral} of value f2(-y/| V|!). However, the fractionality is no 
worse than that of the Boyd-Pulleyblank construction, and gluing does not work for (A/^). Plus, insofar as 
we care about designing approximation algorithms, we may well solve fc-ECSS separately on each block, so 
this does not shed light on limits of the LP-based approach. 

2 Hardness Results 

In our hardness theorem for fc-ECSS, we reduce from the following problem. (Here t±l denotes disjoint union.) 
Path-Cover-of-Tree 

Input: A tree T — (V,E) and another set X C (^) of edges/pairs. 
Output: A subset of Y of X so that (V, E \£Y) is 2-edge-connected. 
Objective: Minimize \Y\. 

Path-Cover-of-Tree is sometimes called the tree augmentation problem and a 1.8-approximation is pub- 
lished [20]; as an aside, it is basically equivalent to the special case of 2-ECSS where the input graph contains 
a connected subgraph of cost zero, plus some unit-cost edges. We give it the alternate name Path-Cover- 
OF-Tree because it is more natural for us to interpret it as covering a tree's edges with a minimum-size 
subcollection of a given collection of paths. To make this explicit, for an edge x = {u,v} G X let P x denote 
the edges of the unique u-v path in T. We rehash the proof of the following proposition since we will recycle 
its methodology. 

Proposition 5 (folklore). Y is feasible for Path-Cover-OF-Tree if and only if \J xeY = E- 

Proof. For every edge e of T, a fundamental cut of e and T means the vertex set of either connected 
component of T\e. 

Let 6f(U) denote S(U) in the graph (V, F). First, Y is feasible if |<5Ettir(C^)| > 2 for every set U with 
^ U C V. But |<5g([/)| is 1 when U is a fundamental cut and at least 2 otherwise; hence Y is feasible iff 
|<^f(C^)| > 1 for every fundamental cut U. 

Second, when U is a fundamental cut, say for an edge eG£, 6y(£/)| > 1 iff {J xeY Px contains e. Taking 
this together with the previous paragraph, we are done. □ 

Path-Cover-of-Tree is shown NP-hard in [23] and a similar construction implies APX-hardness - 
we give the proof in the appendix. As an aside, it is even hard for trees of depth 2; compare this with the 
depth- 1 instances which are in P since they can be shown isomorphic to edge cover. Now we prove the main 
hardness result: 

Theorem 1. Let it be NP-hard to approximate Path-Cover-of-Tree within ratio 1 + e. Then for all 
integers k > 2, it is NP-hard to approximate k-ECSS within ratio 1 + e, even for 0-1 costs. 

Proof. Let (T = (V,E),X) denote an instance of Path-Cover-of-Tree. We construct a fc-ECSS instance 
on the same vertex set, with edge set F. For each e 6 E, we put k — 1 zero-cost copies of the edge e into 
F. For each x £ X, put one unit-cost copy of the edge x into F. These are all the edges of F; and although 
(V, F) is a multigraph, we later show that this can be avoided. 

First we show the multigraph instance is hard. Clearly, there is an optimal solution for the fc-ECSS 
instance which includes all copies of the 0-cost edges. Let (fc — 1)E denote these 0-cost edges. The same 
logic as in the proof of Proposition 5 (analysis using fundamental cuts) shows that Y is a feasible solution 
for the Path-Cover-of-Tree instance if and only if (k — 1)E tfcl Y is a feasible solution for the /c-ECSS 
instance. Since costs are preserved between the two problems, it follows that an a-approximation algorithm 
for fc-ECSS would also give an a-approximation algorithm for Path-Cover-of-Tree, and we are done. 
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Finally, here is how we make (V, F) a simple graph: replace every vertex v 6 V of the tree by a (k + 1)- 
cliquc of 0-cost edges; replace every edge uv E E of the tree by any k — 1 zero-cost edges between the two 
cliques for u and v; replace each edge x S X by any unit-cost edge between the cliques for u and u. We 
proceed similarly to before: when U is a vertex set of the newly constructed graph, we see S(U) has at least 
k 0-cost edges unless U is a "blown- up" version of a fundamental cut (i.e., unless there is a fundamental cut 
Uq of T so that U exactly equals the set of vertices in cliques corresponding to Uq). As before, the residual 
problem assuming these edges are bought is the same as the instance (T,X) (in a cost-preserving way), so 
we are done. □ 



2.1 Hardness of 2-ECSM (Proof of Proposition 2) 

To show that 2-ECSM is APX-hard, we prove that it is "the same" as metric 2-ECSS, i.e. the special case of 
2-ECSS on complete metric graphs. Metric 2-ECSS is APX-hard by a general result of [7] 3 and so this gives 
us what we want. The key observation is the following. 

Proposition 6. In a metric instance, given a 2-ECSM (V,F), we can obtain in polynomial time a 2-ECSS 
(V, F') with c(F') < c(F), as long as \V\ > 3. 

In other words, parallel edges can be eliminated without increasing the cost. (A similar observation in [24] 
turns a 2-ECSS into a 2-VCSS for metric instances.) 

Proof. We may assume F is minimal, i.e. that deleting any edge from (V, F) leaves a non-2-edge-connected 
graph. This implies there are no parallel triples. Next, suppose there is a parallel pair between some vertices 
u and v. If there is any u-v path not using a uv edge, it is easy to see that deleting one of the parallel uv 
edges contradicts minimality. Therefore we may assume uv is a cut edge (bridge) of the simplification of 
(V, F); call this the bridge assumption. 

Since the graph is connected and \V\ > 3, at least one of u or v (say u WOLOG) has another neighbour 
to. By the bridge assumption v is not adjacent to w. We will argue that the set F' obtained by deleting a 
uv edge, a uw edge, and adding a vw edge, is still 2-edge-connected. Iterating this operation we are done 
(since the cost does not increase and the number of parallel pairs decreases each time) . 

Since (V, F) is 2-edge-connected, it has a u-w path P not using the deleted uw edge. By the bridge 
assumption, P does not use any uv edge. Note that |<5p/(S)| < |^f(»S')| only if S contains v and w but 
not u (or vice- versa). But then 5p'{S) contains the remaining uv edge and at least one edge from P. So 
\$F'{S)\ > 2 for all ^ S C V and we are done. □ 

Proof of Proposition 2. Since metric 2-ECSS is APX-hard [7], it is enough to show that any a-approximation 
algorithm for 2-ECSM gives an a-approximation for metric 2-ECSS. The metric 2-ECSS algorithm is: com- 
pute an a-approximately-optimal 2-ECSM F and apply Proposition 6 to get a 2-ECSS F' with c(F') < c(F). 
Using Proposition 6 a second time, and using the fact that every 2-ECSS is trivially a 2-ECSM, we see the 
optimal 2-ECSS and 2-ECSM values are the same. Hence F' is an a-approximately-optimal 2-ECSS, as 
needed. □ 



3 fc-ECSM Conjecture and Connectivity Decomposition 

Here is the conjecture made in the introduction. We will relate it to questions about graph decomposition. 

Conjecture 3. There is a polynomial-time approximation algorithm for k-ECSM which produces a solution 
of value at most (1 + C/k) ■ OPT (A/it) for some universal constant C. 

3 Here is a sketch for the reader, somewhat simpler than the more general results of [7]. Take a family of hard TSP instances 
with costs 1 and 2 [37]. Using a little case analysis, [7] shows that a 2-ECSS can be transformed to a Hamiltonian cycle (TSP 
tour) by repeatedly replacing two edges with one edge, which does not incerase the overall cost if edge costs are 1 and 2; so 
for these instances, TSP and 2-ECSS are the same. In particular on these (metric) instances, finding the min-cost 2-ECSS is 
APX-hard. 
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For positive integers A and B, define f(A, B) to be the least integer / so that every /-edge-connected 
multigraph can be partitioned into two spanning subgraphs, one j4-edge-connected and one £?-edge-connccted. 
Bang- Jensen and Yeo [4] ask the following question, which we call the splitting hypothesis: is there a constant 
C such that f{k,k) <2k + C for all integers kl It has consequences for Conjecture 3: 

Theorem 7. If the splitting hypothesis holds, then every k-ECSM instance has a solution with cost at most 
(1 + C/k) ■ OPT(A/fc), i- e - the integrality gap of (A4) is at most 1 + C/k. 

This would not prove Conjecture 3 due the lack of a polynomial-time algorithm; but one might guess that once 
the core combinatorial problem is solved, a polynomial-time implementation could be found, as happened 
in [14]. 

Before proving Theorem 7 we make some other remarks about /. The Nash-Williams/Tutte theorem 
implies f(A, B) < 2(A + B). The lower bound f(A, B) > A + B is very easy, by considering an (A + B — Ir- 
regular, (A + B — l)-edge-connected graph. This lower bound can be raised by 1 or 2 in a few cases, e.g. the 
fact that a spanning tree has average degree almost 2 implies that f(A, 1) > A + 2 and /(l, 1) > 4. An 
example of another small improvement is that any d-regular, d-edge-connected graph with no Hamilton path 
implies f(d — 2, 1) > d + 1 since, were it to contain a spanning tree disjoint from a (d — 2)-edge-connected 
subgraph, that tree would have maximum degree 2 and hence be a Hamilton path. Such a graph is known 
to exist at least when d = 3 [41, Fig. 5.4] and by taking parallel copies we also get one for any d which is 
a multiple of 3. For any d > 3 there is a d-regular, c?-edge-connected graph with no Hamiltonian cycle [3G], 
similarly implying f(d — 2, 2) > d + 1. 

It seems the only value of / known exactly is /(l, 1) = 4. M. DeVos asked online whether \/A,B : 
f(A, B) < A + B + 2 holds, which is still open. 

Proof of Theorem 7. Let x* be an optimal extreme point solution to (A4). Since x* is rational, there is 
an integer t such that tx* is integral. Then, it is easy to see that tx* (or more precisely, the multigraph 
obtained by taking tx* copies of each edge e) is a tfc-edge-connected spanning multisubgraph. Likewise, for 
any positive integer a, atx* is a (aifc)-ECSM. 

By induction, the splitting hypothesis easily gives the following. 

Claim 8. For all positive integers k and n, every (2 n (k + C) — C)-ECSM can be decomposed into 2™ disjoint 
k-ECSMs. 

Now, for any integer n, let us pick a just large enough that atk > (2 n (k + C) — C). Therefore, atx* can 
be decomposed into 2 n disjoint fc-ECSMs. The cheapest one has cost at most 

4£1 = at2 -e(*-) = r ( * + a C) -> -OPT W ). 

Then using p 2 "^^" ] < [ 2 " ( ^+ c) l < 2 " ( ^ C) + 1, we see there is a fc-ECSM with cost at most 
(2 n {k + C) 



, 1 t2 -n OPT(jVfc) = (1 + C/k + i/2")OPT(A4)- 
V tk I 

This establishes that the integrality gap is no more than 1 + C/k + t/2 n . Taking n — > oo, we are done (since 
the integrality gap is some fixed real, and since t doesn't depend on n). □ 

We feel strongly that the following holds. 
Conjecture 9. /(A, 1) = A + o(A). 

For example, given a 100-edge-connected graph, if we want to delete a spanning tree of our choice and keep 
high edge-connectivity, 49 hardly seems like the best possible. It is not too hard to see (using repeated 
splitting and merging) that the splitting hypothesis would imply f(A, 1) = A + 0(C\a.A) and hence prove 
this conjecture. 

Variants of / have received some attention. For edge-connectivity in hypergraphs, /(l, 1) is not finite [3]. 
It is not known whether the analogue of /(l, 1) in directed graphs is finite [4, 2]. 



http : //garden. irmacs . sf u. ca/?q=op/partitioning_edge_connectivity 
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4 Complex Extreme Points for (A/" 2 ') 



Now we give our construction of a new family of extreme points for the TSP subtour relaxation (A2); as 
mentioned earlier, it can be scaled by k/2 to give an extreme point for (Afj.) or (A4), which is relevant to 
LP-based approaches for /c-ECSM. 

Let Fi denote the ith Fibonacci number, where F± = Fi = 1. For a parameter t > 3, we denote the 
extreme point by x*. The construction is given in the list below and pictured in Figure 2. 

• For i from 1 to t, an edge (2i — 1, 2i) of ic*-value 1 

• For i from 2 to t — 1, an edge (1, 2i) of a;*-value F t _i/F t 

• An edge (1,2*) of ie* -value I/F4 

• For i from 3 to t, an edge (2i — 3, 2i — 1) of x*-value F t -i+i/F t 

• For i from 3 to t, an edge (2i — 4, 2i — 1) of x*-value 1 — F t -i+2/F t 

• An edge (2,3) of x*-value F t -i/F t 

• An edge (2t - 2, 2t) of x*-value 1 - 

The support graph of ie* has 2t vertices and At — 3 edges with fractionality and maximum degree t. 
Therefore, in order to prove Theorem 4, it suffices to show that x* is an extreme point solution. 

Proposition 10. The solution x* described above is an extreme point solution for (A2). 

Proof. With foresight, we write down the following family of At — 3 sets: 

£:= mtviZi- 1, 20U {I,---, 2i}^}- 

The plan of our proof is to first show that ie* is the unique solution to {x(S(T)) = 2 | T G £}. It is easy 
to verify that x* indeed satisfies all these conditions, so let us focus on the harder task of showing that x* 
is the only solution. (Note, we are not assuming that x* is feasible, so possibly x*(S(S)) < 2 for some other 
sets, but we will deal with this later.) 

A set S is tight for a solution x if x(S(S)) = 2. Consider any solution which is tight for all sets in C. We 
first need a simple lemma. For disjoint sets S, T, let 6(S : T) denote the set of edges with one end in S and 
the other in T. 

Lemma 11. For some solution x, if S,T are disjoint tight sets and SUT is also tight, then x(5(S : T)) = 1. 

Proof. We have S(S) = S(S : T) W S(S : V\S\T) and 5(T) = S(S : T) W 6(T : V\S\T). Also, 5(S U T) = 
5(S : V\S\T) W S(T : V\S\T). Thus 2 = x(S(S)) + x(6{T)) - x(S(S U T)) = 2x{6{S : T)). □ 

Consider a hypothetical solution x with x(S(S)) — 2, Vie G C. The lemma shows all edges {2i — 1, 2i}' =1 
have a;-value 1 (take S = {2i — 1},T = {2i}). Define yi equal to a?(2i+i,2i+3) f° r * from 1 to t — 2. The 
degree constraint at 3 (i.e., x{8{2>)) = 2) forces £(2,3) = 1 — yi- The degree constraint at 2 forces £(5,2) = 2/i- 
Note {1, . . . , 2t — 2} is tight since this set has the same constraint as {2t — 1, 2t}. For i from 1 to t — 2, note 
that the sets <5({1, . . . , 2i} : {2i + 1, 2i + 2}) and S(2i + 1) differ only in that the former contains the edge 
(2i + 2, 1) and the latter contains the edges {(2i + l,2i + 2), (2i + l,2i + 3)}. Thus, using the lemma and 
degree constraint at 2i + 1, we see £(21+2,1) + £(2i+i,2i+3) = Hi- The degree constraint at 2i + 2 then forces 
iE(2i+2.2i+5) = 1 — 2/i for 1 < i < t — 3. The degree constraint at 2t — 2 forces £(1,24-2) = 1 — 2/t-2! the degree 
constraint at 2t forces £(i,2t) = 2/t-2- The degree constraint at 2t — 1 forces 2/4-2 = 2/t-3, and the degree 
constraint at 2i + 5 forces j/j = Vi+i + J/i+2 for i from 1 to t — 4; together this shows yi — F t -\-i ■ yt-i for i 
from i — 4 to 1 by induction. The degree constraint at 5 forces 2yi + yi = 1, so (2F t -2 + ^4-3)2/4-2 = 1 and 
consequently yt-2 — 1/-Ft- Thus we conclude that desired. 

Now, we show x* is feasible using standard uncrossing arguments, plus the fact that \C\ = At — 3. In 
(A/^), the constraints for sets S* and V\S are equivalent. Therefore, if we fix any root vertex r G V, we may 
keep only the constraints for sets S not containing r without changing the LP. Correspondingly, we change 
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Figure 2: Our new construction of a complex extreme point x* for the subtour TSP polytope (A/<j), illustrated 
for t — 15. Scaled edge values are shown: the label Fi on an edge e indicates that x* e — Fi/F t . The symbol 
d denotes F t ~F t , i.e. an edge e with x* e = 1 - (Fi/Ft). 
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C by complementing the sets that contain r, and it is easy to see C is a laminar family on V\{r}. (This is 
along the lines of the standard argument by Cornuejols et al. [18].) In fact £ is a maximal laminar family, 
since any laminar family of nonempty subsets of X contains at most 2\X\ — 1 elements, for any set X. 

Finally, suppose for the sake of contradiction that x* is not feasible, so there is a set S, with r $ S, 
having x*(S(S)) < 2. Clearly S $ C. Two sets S, T, neither containing r, cross if all three of S\T, T\S, and 
T n S are non-empty. Take S with x*(S(S)) < 2 such that S crosses a minimal number of sets in C. If S 
crosses zero sets in C, then C U {S} is laminar, but this is a contradiction since S $ C and, crucially, C was 
maximal. Otherwise, set S crosses some tight set T 6 C, then since 

2 + 2 > x*(5(S)) + x*(6(T)) > x*{S{S U T)) + x*{6{S n T)), 

either x*(5(S U T)) < 2 or x*(S(S n T)) < 2. It is easy to verify that both S U T and 5 CI T cross fewer sets 
of £ than S, contradicting our choice of S. □ 

4.1 Methodology 

To investigate extreme points of (TV^), we first used computational methods to try to find the most "inter- 
esting" small examples. There are a number of properties that the support graph must have, e.g. no more 
than 2n — 3 edges, 3-vertex-connected (or else it is essentially a 2-sum of smaller solutions), and our method 
was to compute all extreme points on all such graphs. See Boyd [6, 10] for more discussion of how these 
steps can be implemented. We used nauty [35] to generate the graphs, and the Maple package convex [22] to 
enumerate extreme points. The Maple package available at the time did not have a good interface for laying 
out graphs, so we created a procedure [38] to export the graphs to GeoGebra [30], which is well-suited for 
layout (and exporting for diagrams in this document). We found the following interesting examples, which 
are pictured in Figure 3. Note "unique" means unique up to graph isomorphism. 

(a) for n < 6, there is a unique extreme point with denominator > 2 

(b) for n < 7, there is a unique extreme point with maximum degree > 4 

(c) for n < 8, there is a unique extreme point with denominator > 3 

(d) for n < 9, there is a unique extreme point with maximum degree > 5 

(e) for n < 9, there is a unique extreme point with denominator > 4 

(f) for n < 10, the maximum degree that occurs is 5 and the maximum denominator is 5; there is a unique 
solution on 10 vertices that attains both simultaneously 

We found that there was some primal structure and dual structure to the 10- vertex example which was 
shared with the smaller examples (a) and (c); these observations led to the family described in Section 4. 
We remark that the extreme points pictured, and more generally our new construction, do not coincide with 
the families of Boyd and Pulleyblank [11] or Cheung [16] for any choice of parameters. 

4.2 Discussion 

The construction given shows that extreme points on n vertices of the Held-Karp relaxation may have 
maximum support degree as big as n/2 and fractionality as small as l/F n /2, for even n. A natural question 
is whether these bounds are maximal. Boyd, with Benoit [G] and Elliott-Magwood [10], has computed and 
posted online [8] a list of all vertices of the subtour elimination polytope for up to 12 vertices. Filtering 
through that data, we find the following facts. 

Remark 12. For 11-vertex solutions, the largest maximum degree is 6, the largest denominator is 8, and of 

11- vertex solutions with maximum degree 6, the maximum denominator is 5 which is uniquely attained. For 

12- vertex solutions, the largest maximum degree is 6, the largest denominator is 9, and of 12-vertex solutions 
with maximum degree 6, the maximum denominator is 8 which is uniquely attained. 
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Figure 3: Six extreme points for the subtour TSP polytope (A/^) with extremal properties. 
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Hence for even n, F n / 2 is not the maximum possible denominator. Based on the available data, we 
conjecture the following. 

Conjecture 13. The maximum degree of extreme points on n vertices is exactly \n/2~\ . 

The best upper bound we are aware of is n — 3, which follows from the fact that each basic solution has 
at most 2n — 3 edges, plus an easy argument to eliminate degree-2 vertices. 

4.3 Relation to Asymmetric TSP 

Asymmetric TSP is the analogue of TSP for directed graphs: we are given a metric directed cost function on 
the complete digraph (V,A), and seek a min-cost directed Hamiltonian cycle. Recently Asadpour et al. [1] 
obtained a breakthrough 0(logn/loglogn) approximation for this problem; its analysis uses the fact that 
extreme points of the natural LP relaxation 

{yeRi :V0^C/ CV, y(5° ut (17)) > 1} {A) 

have denominator bounded by 2°( nlnn \ Our undirected construction implies that for this directed variant, 
the extreme points attain denominator at least 2 n ( n \ 

Proposition 14. For even n > 6 there are extreme points for (A) on n vertices with fractionality \jF n i 2 
or smaller (and hence denominator at least F n / 2 ). 

Proof. The key is to note that (A/2) equals the projection of (A) to obtained by setting xs u v -j — 
V(u,v) + U(v,u) for all {u,v} € (^) (call this map dropping directions). One direction is evident: given y, it 
has value at least 1 both coming into and coming out of every nontrivial cut set £/, hence its undirected 
image x has value at least 2 spanning the cut it defines, i.e. x(S(U)) > 2. Conversely, to show that for every 
x G (A/2), there is a y 6 (.4) of this type, just assign = y( v , u ) — X{ u v }/2 for all {u,v} 6 (^). 

Now we prove Proposition 14. Consider x* given by the construction, and consider the set of all y in (.4) 
such that y becomes x* when dropping directions. The argument in the previous paragraph establishes that 
this set is nonempty, and it is not hard to see this set is a face of (.4) since x* is an extreme point of (J\f 2 ). 
Finally, let y* be any extreme point of this face. Our construction includes an edge e with x* e = l/F n / 2 , 
hence at least one of the two arcs corresponding to e has y*-value in (0, l/F n / 2 ], giving the claimed result. 

As a remark, the above proof leaves open the possibility that the extreme points y* for („4) could have 
strictly worse fractionality than l/F n / 2 , but according to our computational experiments for n = 6,8, the 
worst-case fractionality for such y* is exactly l/F n / 2 - □ 

4.3.1 Integrality Gap 

Several papers of Boyd and coauthors investigate TSP LP extreme points with the goal of lower-bounding 
the integrality gap, therefore it is natural to ask what integrality gap is implied by the construction given in 
this paper. It does not appear that our construction gives a good integrality gap lower bound; for 6, 8, 10, 12 
vertices we have computed that the integrality gap obtained is only §, ff , §§, §§■ (Specifically, this value is 
the least t > such the extreme point is dominated by t times a convex combination of indicator vectors of 
Hamiltonian cycles.) 
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A Hardness of Path- Cover-of- Tree 

Our arguments are based on those of [23], and also inspired by [27], who used the same approach to prove 
APX-hardness of a related packing problem. We reduce from minimum set cover in 3-uniform, 2-regular 
hypergraphs — i.e. set cover with sets of size 3, each set appearing in exactly 2 sets — which is equivalent 
to vertex cover in cubic graphs. The best known inapproximability ratio for this problem is about ^jp, due 
to Chlebfk and Chlebikova [17]. 

Here is the reduction. Let the instance of 3-uniform, 2-regular set cover be (J, K.) where J is the ground set 
and K is the family of triples from J. Let k = |/C| (so |J| = 3fc/2) and denote the sets by Ki = {a[i}, b[i], c[i]} 
for 1 < i < k (so a[i], b[i], c[i] are elements of J). The tree T we construct for the Path-Cover-OF-Tree 
instance has a root vertex r, a vertex Vj for each j £ J, and two vertices pi, qi for 1 < i < k; T has an edge 
{r, Vj} for every j £ J, and the two edges {v a ^ ,pt}, {v a [{\ , qi} for 1 < i < k. Finally, we define the set X to 
have the following 3fc pairs: {Pi,<7;}, {Pi,v b ^}, {qi,v c ^} for 1 < i < k. 

Claim 15. OPT(T, X) = k + OPT( J, K). 

(We speak of Path-Cover-OF-Tree in terms of covering E(T) instead of as a 2-connectivity problem.) 

Proof. Let {K t \ i £ 1} be an optimal set cover, i.e. a J-covering subfamily of K. such that |/| = OPT( J, JC). 
Define Fclas follows: if i £ I we put {pi, Uf,r,-i } and {q i7 v c ^} into Y, and if i I we put {pi, q{\ into Y . 
In either case, the corresponding paths in T cover the edges incident to pi and qf, and it is not hard to see 
that since / is a set cover, all edges incident to r are also covered. This proves OPT(T, X) < 2\I\ + (k— \I\) = 
fc + OPT(J,/C). 

The reverse inequality is similar. The only step needing pause is to consider whether (T, X) always has 
an optimal solution Y of the form generated by the above mapping (since then it can be reversed) . Indeed, 
if Y contains one or fewer of the 3 pairs {pi, qt}, {pi, v^}, {qi, v c [{\\ then it must contain {pi, q{\ to cover the 
edges incident to pi and qf, and if Y contains two or more of the pairs, we can adjust such pairs to {pi, i^m} 
and {<Zi,v c [i]} without increasing |Y| and without causing an edge of T to become uncovered. □ 

Here are the calculations that show the reduction works. We have OPT( J, JC) > k/2 (since we need 
to cover 3fc/2 points by triples), and by the result of [17], no polynomial-time algorithm can determine 
OPT( J, K) within additive error on all instances, unless P = NP. Hence, no polynomial-time algorithm 
can determine OPT(T, X) within the same additive error. Finally, since OPT(T, X) < 2k, we get an 
inapproximability ratio of 1 + ^|g/2fc = 1 + g|g for Path-Cover-OF-Tree. However, if we actually look 
at the gap instances of [17], the same calculations give a slightly stronger ratio of 1 + 5^4- 
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